A new Markov model for Web access prediction

نویسندگان

  • Dongshan Xing
  • Junyi Shen
چکیده

tributed hypertext repository of information, which users navigate through links and view with browsers. The heavy Internet traffic resulting from the Web’s popularity has significantly increased userperceived latency. The obvious solution—to increase the bandwidth—is not viable, because we cannot easily change the Web’s infrastructure (the Internet) without significant economic cost. However, if we could predict future user requests, we could put those pages into the clientside cache when the browser is free. When a user requests one of the pages, the browser could retrieve it directly from cache. Much current research has examined modeling and predicting user access behavior on the Web to improve Web prefetching,1,2 enhance search engines,3 and understand and influence buying patterns.4 To predict Web access, we need a method for modeling and analyzing Web access sequences. With this information, we can deduce future user requests. Some researchers have used traditional Markov models, which are often employed to study stochastic processes and predict user access behavior.5,6 In general, they use the sequence of Web pages a user has accessed as input, with the goal of building Markov models with which they can predict the page the user will most likely access next. Venkata N. Padmanabhan and Jeffrey C. Mogul used N-hop Markov models to improve prefetching strategies for Web caches;2 Ramesh R. Sarukkai used Markov models to predict the next page accessed by the user;7 and Igor V. Cadez and colleagues used Markov models to categorize user sessions.8 Peter Pirolli and colleagues, however, tested the performance of different-order Markov models for Web access prediction and found traditional Markov models to be inadequate for this purpose.5 Therefore, we need a new Markov model for Web access prediction. Our hybrid-order tree-like Markov model can predict Web access precisely, providing high coverage and good scalability. HTMM intelligently merges two methods: a tree-like structure Markov model method that aggregates the access sequences by pattern matching and a hybrid-order method that combines varying-order Markov models. Performance evaluations comparing our HTMM with traditional Markov models confirm its usefulness.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A model for specification, composition and verification of access control policies and its application to web services

Despite significant advances in the access control domain, requirements of new computational environments like web services still raise new challenges. Lack of appropriate method for specification of access control policies (ACPs), composition, verification and analysis of them have all made the access control in the composition of web services a complicated problem. In this paper, a new indepe...

متن کامل

Integrating Markov Model with KNN Classification for Web Page Prediction

World Wide Web is growing rapidly in recent years. User’s experience on the internet can be improved by minimizing user’s web access latency. This can be done by predicting the next step taken by user towards the accessing of web page in advance, so that the predicted web page can be prefetched and cached. So to improve the quality of web services, it is required to analyze the user web navigat...

متن کامل

Marcov Models for Web Access Prediction

The problem of predicting user’s behavior on a Web site has fundamental significance due to the rapid growth of the World Wide Web. Although traditional Markov models have been found to be suited for addressing this problem, they have serious limitations. Thus, good predictions require new Markov models. Hybrid-order tree-like Markov models predict Web access precisely while providing high cove...

متن کامل

A Vague Improved Markov Model Approach for Web Page Prediction

Today most of the information in all areas is available over the web. It increases the web utilization as well as attracts the interest of researchers to improve the effectiveness of web access and web utilization. As the number of web clients gets increased, the bandwidth sharing is performed that decreases the web access efficiency. Web page prefetching improves the effectiveness of web acces...

متن کامل

Prediction of Land Use Change and its Hydrological Effects Using Markov Chain Model and SWAT Model

Access to current and future water resources is one of the concerned problems for managers and policymakers around the world. Because of the communication between water resources and land use, these two topics had come together in different researches. Scenarios designed in regional land planning provide the basis for analyzing the existing opportunities and making the right decisions for manag...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computing in Science and Engineering

دوره 4  شماره 

صفحات  -

تاریخ انتشار 2002